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Abstract. Tiling recognizable two-dimensional languages, also known as REC, 
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generalize recognizable string languages to two dimensions and share with them 



O ■ several theoretical properties. Nevertheless REC is not closed under complemen- 



tation and the membership problem is NP-complete. This implies that this family 
REC is intrinsically non-deterministic. The natural and immediate definition of 
£T) \ unambiguity corresponds to a family UREC of languages that is strictly contained 

in REC. On the other hand this definition of unambiguity leads to an undecid- 
ability result and therefore it cannot correspond to any deterministic notion. We 
introduce the notion of line-unambiguous tiling recognizable languages and prove 
that it corresponds or somehow naturally introduces different notions of dctcrmin- 
■ ism that define a hierarchy inside REC. 

A picture (or two-dimensional string) is a two-dimensional arrays of symbols 
from a finite alphabet. A set of pictures is called two-dimensional language. Basic 
notations and operations can be extended from string to pictures. The size of a 
picture p is a pair (m, n) corresponding to the number of its rows and columns, 
respectively. Moreover there can be defined an operation of column-concatenation 
between pictures with the same number of rows and of row-concatenation between 
pictures with the same number of columns. By iteration, there can be also defined 
the corresponding row- and column- star operations. 

The first generalization of finite-state automata to two dimensions can be at- 
tributed to M. Blum and C. Hewitt who in 1967 introduced the notion of a four-way 
automaton moving on a two-dimensional tape as the natural extension of a one- 
dimensional two-way finite automaton (see [7]). They also proved that the deter- 
ministic version corresponds to a language class smaller than the corresponding one 
defined by the non-deterministic model. Four-way automata was not a successful 
model since the corresponding language class does not satisfies important properties 
as closure under concatenation and star operations. Since then, many approaches 
have been presented in the literature in order to find the "right way" to generalize 
in 2D what regular languages are in one dimension: finite automata, grammars, 
logics and regular expressions (see for example [H HU [251 HH EI])- Here we focus 
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on the family REC of tiling recognizable picture languages (see [HI [T5] that have 
been widely investigated and that it is considered as a valid candidate to represent 
a counter part to 2D of regular string languages. 

The definition of REC takes as starting point a characterization of recognizable 
string languages in terms of local languages and projections (cf. [H]). A picture 
language L is local if it is defined by a finite set of 2 x 2 pictures, called tiles that 
represent all allowed sub-pictures of size (2,2) for pictures in L. A pair composed 
by a local language over an alphabet V and an alphabetic projection n : Y — > £ 
is called tiling system. A picture language L over an alphabet £ is recognized by 
a tiling system (given by a local language V and n) if each picture p G L can be 
obtained as projection of a picture p' G V ( i.e. p = ir(p'))- A picture language 
is tiling recognizable if it is recognized by a tiling system. REC is the family of 
tiling recognizable picture languages. We point that languages of infinite picture 
(w-pictures) were also studied in the setting of tiling systems in (TJ [12j [13] . 

It can be verified that REC is closed under union and intersection, rotation and 
mirror and under column- and row- concatenation and star operations. Moreover, 
the definition of REC in terms of tiling systems turns out to be very robust: in [151 
[T7] it is shown that the family REC has a characterization in terms of logical formulas 
(a generalization of Biichi's theorem for strings to 2D). In [19J, it is proved that 
REC has a counterpart as machine model in the two-dimensional on-line tessellation 
acceptor (OTA) introduced by K. Inoue and A. Nakamura in [18]. Other models of 
automata for REC are proposed in [UE1E3]. Tiling systems can be also simulated by 
domino systems [19] and Wang tiles [10] and grammars [9] . Further we remark that 
when pictures degenerate in strings (i.e. when considering only one-row pictures) 
recognizability by tiling systems corresponds exactly to recognizability by finite state 
string automata. 

A crucial difference with the one-dimensional case lies in the fact that the defini- 
tion of recognizability by tiling systems is intrinsically non-deterministic. Determi- 
nistic machine models to recognize two-dimensional languages have been considered 
in the literature: they always accept classes of languages smaller than the corre- 
sponding non-deterministic ones (see for example, [3 [HI [26]). This seems to be 
unavoidable when jumping from one to two dimensions. Further REC family is not 
closed under complementation and therefore the definition of any constraint to force 
determinism in tiling systems should necessary result in a class smaller than REC. 
Strictly connected with this problems are the complexity results on the recognition 
problem in REC. Let L be a language in REC defined by a tiling system composed 
by a local picture language V and a projection it. To recognize that a given picture p 
of m rows and n columns belongs to L, one has to "rewrite" symbols in all positions 
in p to get a local picture p' that belongs to V and such that 7r(p') = p. This can be 
done by scanning all positions of p in some order. The non-determinism implies that, 
once reached a given position one may eventually backtrack on all positions already 
visited, that is on 0(mn) steps. Moreover in [21] it is proved that the recognition 
problem for REC languages is NP-complete. 

In formal language theory, an intermediate notion between determinism and non- 
determinism is the notion of unambiguity. In an unambiguous model, we require that 
each accepted object admits only one successful computation. Both determinism 
and unambiguity correspond to the existence of a unique process of computation, 
but while determinism is a "local" notion, unambiguity is a fully "global" one. 
Unambiguous tiling recognizable two-dimensional languages have been introduced in 
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[H], and their family is referred to as UREC. Informally, a picture language belongs 
to UREC if it admits an unambiguous tiling system, that is if every picture has a 
unique pre- image in its corresponding local language. In [5], the proper inclusion 
of UREC in REC is proved but it is also proved that it is undecidable whether a 
given tiling system is unambiguous. From a computational side, there are not known 
algorithms to recognize pictures that exploit the properties of UREC. This implies 
that, at each step of the recognition computation, it can be necessary to backtrack 
on all already visited positions. 

A relevant goal is then to find subclasses for REC that inherit important prop- 
erties but also allow feasible computations. Moreover an interesting result would be 
proving that, as for regular string languages, notions of some kind of unambiguity 
and determinism coincide. 

Remark that another difference between unambiguity and determinism is that 
determinism is always related to a scanning strategy to read the input. In the string 
case the scanning is implicitly assumed to be left-to right and in fact determini- 
stic automata are defined related to this direction. Moreover since deterministic, 
non-ambiguous and non-deterministic models are all equivalent there is no need to 
consider determinism from right-to-left (referred to as co-determinism). Neverthe- 
less it is worthy to remark that not all regular string languages admits automata 
that are both deterministic and co-deterministic. In the two-dimensional case we 
have to consider all the scanning directions from left, right, top and bottom sides. 

By exploiting the different possibilities of scanning for a two-dimensional array 
in [31 E] there are introduced different notions of unambiguity we call here line- 
unambiguity where a line can be either a column or a row or a diagonal. We consider 
tiling systems for which the computations to recognize a given picture can have at 
each position a backtracking on at most m + n steps. Such definitions lie between 
those of unambiguity and determinism (as long as we consider that a deterministic 
computation has zero backtracking steps at each position) while they all coincide 
with determinism when pictures degenerate in strings. 

The informal definitions are very simple and natural. A tiling system is column- 
unambiguous if, when used to recognize a picture by reading it along a left-to-right or 
right-to left direction, once computed a local column, there is only one possible next 
local column. As consequence in a computation by a column-unambiguous tiling 
system to recognize a picture with m rows, the backtracking at each step is at most 
of m steps. Similarly there are defined row-unambiguous and diagonal-unambiguous 
tiling systems corresponding to computations that proceed by rows or by diagonals, 
respectively. The corresponding families of languages are denoted by Col-UREC, 
Row- UREC and Diag- UREC. In [31 [2] there are proved necessary conditions for a 
language to be in Col-UREC and in Row-UREC. Using such conditions one can 
show that families Col-UREC and Row-UREC are strictly contained in UREC. In 
a different set-up it is also shown that Diag-UREC is strictly included both in Col- 
UREC and Row-UREC. Moreover all those properties are decidable. 

Very interestingly we can prove that diagonal-unambiguous tiling systems are 
equivalent to some deterministic tiling systems where the uniqueness of computation 
is guaranteed by certain conditions on the set of local tiles: the corresponding lan- 
guage family is denoted by DREC ([3])- Similar results hold for classes Col-UREC 
and Row-UREC whose union turns to be equivalent to another "deterministic" class 
named Snake-DREC [23]. All those classes are closed under complementation pi 123]. 
As result, when we consider this line unambiguity we can prove equivalence with 
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deterministic models and therefore we guarantee a recognition algorithm linear in 
the size (i.e. number of rows times number of columns) of the input. 
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